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ON P VERSUS NP AND INFINITE TURING MACHINES 


FRANK VEGA 


ABSTRACT. P versus NP is considered one of the great open problems of sci- 
ence. This consists in knowing the answer of the following question: Is P equal 
to NP? This incognita was first mentioned in a letter written by John Nash 
to the National Security Agency in 1955. However, a precise statement of the 
P versus NP problem was introduced independently in 1971 by Stephen Cook 
and Leonid Levin. Since that date, all efforts to find a proof for this huge 
problem have failed. It is currently accepted that a positive answer for P ver- 
sus NP would have tremendous effects not only in computer science, but also 
in mathematics, biology, etc. We define a problem that we call Simple Subset 
Product. We show Simple Subset Product is an NP-complete problem. More- 
over, we prove Simple Subset Product is also in P. In this way, we demonstrate 
that P = NP. 


INTRODUCTION 


P versus NP is a major unsolved problem in computer science [3]. It is consid- 
ered by many to be the most important open problem in the field [3]. It is one of 
the seven Millennium Prize Problems selected by the Clay Mathematics Institute 
to carry a US$1,000,000 prize for the first correct solution [3]. 

In 1936, Turing developed his theoretical computational model [1]. The deter- 
ministic and nondeterministic Turing machines have become in two of the most 
important definitions related to this theoretical model for computation. A de- 
terministic Turing machine has only one next action for each step defined in its 
program or transition function [7]. A nondeterministic Turing machine could con- 
tain more than one action defined for each step of its program, where this one is 
no longer a function, but a relation [7]. 

Another huge advance in the last century has been the definition of a complexity 
class. A language over an alphabet is any set of strings made up of symbols from 
that alphabet [2]. A complexity class is a set of problems, which are represented as 
a language, grouped by measures such as the running time, memory, etc [2]. 

In the computational complexity theory, the class P contains those languages 
that can be decided in polynomial time by a deterministic Turing machine [5]. The 
class NP consists in those languages that can be decided in polynomial time by a 
nondeterministic Turing machine [5]. 

The biggest open question in theoretical computer science concerns the relation- 
ship between these classes: Is P equal to NP? In 2002, a poll of 100 researchers 
showed that 61 believed that the answer was not, 9 believed that the answer was 
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yes, and 22 were unsure; 8 believed the question may be independent of the cur- 
rently accepted axioms and so impossible to prove or disprove [4]. All efforts to 
solve the P versus NP problem have failed [7]. Nevertheless, we prove P = NP. 


1. ‘THEORETICAL NOTIONS 


Let X be a finite alphabet with at least two elements, and let X* be the set of 
finite strings over X [1]. A Turing machine M has an associated input alphabet Y 
[1]. For each string w in X* there is a computation associated with M on input w 
[1]. We say that M accepts w if this computation terminates in the accepting state, 
that is, M(w) = “yes” [1]. Note that M fails to accept w either if this computation 
ends in the rejecting state, or if the computation fails to terminate [1]. 

The language accepted by a Turing machine M, denoted L(M), has an associated 
alphabet X and is defined by 


L(M) = {w € XX : M(w) = “yes” }. 


We denote by tx (w) the number of steps in the computation of M on input w [1]. 
For n € N we denote by Tm(n) the worst case run time of M; that is 


Ty(n) = max{ty(w):w € 2"} 


where X” is the set of all strings over © of length n [1]. We say that M runs in 
polynomial time if there exists k such that for all n, Ty(n) < nf + k [1]. 


Definition 1.1. A language L is in class P if L = L(M) for some deterministic 
Turing machine M which runs in polynomial time [1]. 


We state the complexity class NP using the following definition. 


Definition 1.2. A verifier for a language L is a deterministic Turing machine M, 
where 

L={w:M(w,c)= “yes” for some string c}. 
We measure the time of a verifier only in terms of the length of w, so a polyno- 
mial time verifier runs in polynomial time in the length of w [8]. A verifier uses 
additional information, represented by the symbol c, to verify that a string w is a 
member of L. This information is called certificate. 


Observe that, for polynomial time verifiers, the certificate is polynomially bounded 
by the length of w, because that is all the verifier can access in its time bound [8]. 


Definition 1.3. NP is the class of languages that have polynomial time verifiers 
[8]. 


A function f : 2* + X* is a polynomial time computable function if some 
deterministic Turing machine M, on every input w, halts in polynomial time with 
just f(w) on its tape [8]. Let {0,1}* be the infinite set of binary strings, we say 
that a language Lı C {0,1}* is polynomial time reducible to a language Lo C 
{0, 1}*, written Lı <p Lo, if there exists a polynomial time computable function 
f : {0,1}* + {0,1}* such that for all x € {0,1}*, 


x € Li iff f(x) € La 


where iff means “if and only if”. An important complexity class is NP-complete 
[5]. A language L C {0,1}* is NP-complete if 
(1) LE NP, and 
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(2) L’ <p L for every L’ € NP. 

Furthermore, if L is a language such that L’ <, L for some L’ € NP-complete, 
then L is in NP-hard [2]. Moreover, if L € NP, then L € NP-complete [2]. No 
polynomial time algorithm has yet been discovered for any NP-complete problem 
[3]. 

A principal NP-complete problem is SAT [5]. An instance of SAT is a Boolean 
formula which is composed of 

(1) Boolean variables: £1, £2,..., En; 

(2) Boolean connectives: Any Boolean function with one or two inputs and one 
output, such as A(AND), V(OR), —(NOT), >(implication), (iff); 

(3) and parentheses. 

A truth assignment for a Boolean formula ¢ is a set of values for the variables in 
@. A satisfying truth assignment is a truth assignment that causes è to be evaluated 
as true. A formula with a satisfying truth assignment is a satisfiable formula. The 
problem SAT asks whether a given Boolean formula is satisfiable [5]. 

Another NP-complete language is 3CNF satisfiability, or 3S. AT [2]. We define 
3CNF satisfiability using the following terms. A literal in a Boolean formula is an 
occurrence of a variable or its negation [2]. A Boolean formula is in conjunctive 
normal form, or CNF, if it is expressed as an AND of clauses, each of which is the 
OR of one or more literals [2]. A Boolean formula is in 3-conjunctive normal form 
or 3CNF, if each clause has exactly three distinct literals [2]. 

For example, the Boolean formula 


(a1V — a1V + z2) A (a3 V £2 V z4) A (> T1V — x3V — xa) 





is in 3CNF. The first of its three clauses is (11 V — x1V — 2), which contains the 
three literals 71, + zı, and — xg. In 3S AT, it is asked whether a given Boolean 
formula ¢ in 3CNF is satisfiable. 

It can be demonstrated that many problems belong to NP-complete using a 
polynomial time reduction from 3S AT [5]. For example, the well-known problem 
1-IN-3 8SAT which is defined as follows: Given a Boolean formula è in 3CNF, is 
there a truth assignment such that each clause in @ has exactly one true literal? 


2. RESULTS 
2.1. Simple Subset Product. 


Definition 2.1. SIMPLE SUBSET PRODUCT 

INSTANCE: A list of numbers L and a positive integer k with its prime factor- 
ization, such that the prime factorization of k does not contain any prime power 
with exponent greater than 1. 

QUESTION: Is there a subset of numbers from L whose product is k? 

We denote this problem as SSP. 


Theorem 2.2. SSP € NP-complete. 


Proof. Since SSP is only a restriction of problem SUBSET PRODUCT, then 
SSP € NP. Certainly, we can check whether the product of a subset of numbers 
from L is indeed k in polynomial time. We present a polynomial time reduction 
from 1-IN-3 3SAT to SSP. For a 3C NF formula ¢ with m clauses and n variables, 
we consider the sets 1.4, 51,5... Sn, Sn, € {1,...,m} such that S; + (resp., S;,;) 
is the set of the indices of the clauses (of @) that are satisfied by setting the it” 
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variable to true (resp., false). That is, if the i*” variable appears un-negated in the 
jt” clause then j € S; +, whereas if the i” variable appears negated in the j*” clause 
then j € S;,¢. Indeed, S; t U Si f equals the set of clauses containing an occurrence 
of the i!” variable, and the union of all these 2 x n sets equals {1,...,m}. Besides, 
we augment the universe with n additional elements and add the it” such element 
to both S; and S; f. Thus, the reduction proceeds as follows: 


(1) On input a 3CNF formula ¢ (with n variables and m clauses), the reduction 
computes the sets S14, 51,7 ..-Sn,t,Sn,f such that S;4 (resp., Si, f) is the 
set of the indices of the clauses in which the it” variable appears un-negated 
(resp., negated). 

(2) The reduction creates the sets S1... S2xn, where à = 1...n it holds that 
Soxi-1 = Sit U {m + i} and Sox; = i,f U {m + i}. 

(3) Establish a bijective mapping between the numbers {1,..., m+n} and the 
first m +n prime numbers. Replace the members inside of S4 ...S2xn with 
the mapped primes. 

(4) For each set in Si... S2xn multiply its members together, the resulting list 
of products is L for the SSP instance. Because prime numbers are used 
for the mapping in the previous step, the products are guaranteed to be 
equivalent iff the sets are equivalent by the unique factorization theorem 
[6]. 

(5) Multiply the first m+n primes p1,...,PDm+n together, the resulting product 
is the value & for the SSP instance. 

Note that (L, k) is a yes-instance of SSP iff ¢ is a yes-instance of 1-IN-3 8SAT. 
Assume, on the one hand, that $ has the certificate 71,...,7n. Then, for every 
j € {1,...,m} there exists an i € {1,...,n} such that setting the it variable to 
Ti, satisfies the jt” clause. Since, k does not contain prime powers with exponent 
greater than 1 then we guaranteed that exactly one literal is true per each clause. 

The transformation steps involves operations that are polynomial to the size of 
the input @. The first m + n primes can be generated in time O(m + n) using the 
sieve of Eratosthenes and are guaranteed to fit into O((m+n)? x In(m+n)) space 
by the prime number theorem [6]. 














2.2. Infinite Turing machines. 


Definition 2.3. A Turing machine is a quadruple M = (K,X,6,s). K is a finite 
set of states; s € K is the initial state. X is a finite set of symbols (we say X 
is the alphabet of M). We assume K and E are disjoint sets. © always contains 
the special symbols U and >: The blank and first symbol. Finally, 6 is a transition 
function, which maps K x £ to (KU{h, “yes”, “no” })xLx{,4,-}. We assume 
that h (the halting state), “yes” (the accepting state), “no” (the rejecting state), 
and the cursors directions — for “left”, + for “right” and — for “stay”, are not 
in kK UX. 


We can define the infinite Turing machine formally using the above notions. 


Definition 2.4. An infinite Turing machine is a quadruple M = (K, £, ô, s), where 
Z and s are exactly as an ordinary Turing machine, but the set K must contain 
infinite states. 6 is a transition function that must reflect the complexities of infinite 
states. Intuitively, 6 decides the next state as before, but also contains infinite 
actions (steps). Nevertheless, the final states are unique, that is there should be 
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only one h (the halting state), “yes” (the accepting state) and “no” (the rejecting 
state) in M. 


Definition 2.5. We define the bit length of a Turing machine M as the string 
length of the encoding of M by the binary alphabet of the Universal Turing machine. 
À Turing machine M has an infinite bit length if M is an infinite Turing machine. 
We say that a Turing machine has a finite bit length when M is an ordinary Turing 
machine. 


Theorem 2.6. Every language L can be decided by a Turing machine M of infinite 
bit length in polynomial time. 


Proof. Since the transition function of M could store infinite actions, then M could 
decide every x € L in linear time. 














Lemma 2.7. A language L is in P iff L can be decided by a Turing machine M 
of finite bit length in polynomial time. 











Proof. This is a consequence of the self definition of the class P. 
2.3. On P versus NP. 


Definition 2.8. COMPLEX SUBSET PRODUCT ON I-TH PRIME 

INSTANCE: A list of numbers L and a positive integer k with its prime factor- 
ization, such that the prime factorization of k does not contain any prime power 
with exponent greater than 1 and no prime factor is greater than the it” prime 
number. 

QUESTION: Is there a subset of numbers from L whose product is k? 

We denote this problem as CSP,. 





Theorem 2.9. For every positive integer i we have that CSP, € P. 


Proof. CSP, € P, since the target k will necessarily be equal to 2. Indeed, we 
would only need to check whether the list L contains the number 2. Assume for 
some natural number è > 1, we have CSP, € P. Let’s prove CSP;+1 € P. 

Suppose we have an instance (L, k) of CSP;,1. If the target k does not contain 
the (i +1) prime number, then (L, k) € CSP, iff (L, k) € CSP;11. If the target 
k is equal to the (i + 1)” prime number, then we replace the occurrence of the 
number p;+1 by the previous prime p; inside the list L and target k. The resulting 
instance (L’,k’) complies with (L’,k’) € CSP; iff (L, k) € CSPy41. 

Now, suppose the target k contains the prime factor p;}1 and another p; where 
obviously pj < pi+1. Thus we multiply each pair of numbers inside of L such 
that those number contains either p;+1 or p; but not both. After that, we remove 
those previous numbers that contains only the prime factor p;j+1 or pj but not 
both. Finally, we replace the product p;+1 x p; by the prime p; in the factorization 
of the elements in the resulting list. We do exactly the same with the target k 
which as well contains the product p;+1 x pj as a factor number of its factorization. 
The resulting instance (L’,k’) complies with (L’,k’) € CSP, iff (L,k) € CSP;41. 
Certainly, if (L, k) € CSP;+1 then every possible certificate of (L, k) contains a pair 
of numbers which has the corresponding prime factor p;}1 or p; or just one single 
number that contains both. 

In this way, we can reduce in polynomial time C'SP;,; to CSP;. Since P is 
closed under reductions and CSP; € P, it follows that C'SP;,1 € P. Hence, for 
every i € N we have proved CSP; € P by mathematical induction [6]. 
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Theorem 2.10. SSP € P. 


Proof. For every positive integer à the set C'S P; contains infinite elements. Indeed, 
the list L could vary into an infinite ways even though the target could be the same. 
Since for every positive integer è we have that CSP; € P, then every language C'S P; 
will have a Turing machine M;, and this machine decides each instance of C'S P; in 
polynomial time. We denote /(M;) as the bit length of M;. We also denote |C'SP,| 
as the cardinality of CSP;. Certainly, lim; 4 = 0 since there are infinite 
prime numbers, CSP, has infinite elements and the bit length of M; is finite because 
CSP, € P. Thus lim; oor = eno) = 0. Moreover, we can assure that 
lim; |CSP;| = [SSP|, because lim;,.. CSP; = SSP. Therefore, we obtain 
imig = 0. By the definition of M;, we have lim; sk M; = Mssp where 
Mssp could be a Turing machine which decides the problem instances of SSP in 
polynomial time. However, Mssp might have an infinite bit length. Indeed, Mssp 


has a finite bit length if SSP € P. Consequently, Pere = Ce =D 
(Mssp) _ 


Since SSP contains infinite elements and SSP 0, then Mssp has a finite bit 
length and thus SSP € P. 




















Theorem 2.11. P= NP. 


Proof. If any single NP-complete problem can be solved in polynomial time, then 
every NP problem has a polynomial time algorithm. Since SSP is complete for 
NP and SSP € P, it follows that P = NP. 














CONCLUSIONS 


After decades of studying the NP problems no one has been able to find a poly- 
nomial time algorithm for any of more than 300 important known NP-complete 
problems [5]. Even though this proof might not be a practical solution, it shows in 
a formal way that many currently mathematical problems can be solved efficiently, 
including those in NP-complete. At the same time, this demonstration would rep- 
resent a very significant advance in computational complexity theory and provide 
guidance for future research. On the one hand, it proves that most of the existing 
cryptosystems such as the public-key cryptography are not safe [8]. On the other 
hand, we might be able to find a formal proof for every theorem which has a proof 
of a reasonable length by a feasible algorithm. 
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